Stiff Equation
   HOME

TheInfoList



OR:

In
mathematics Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...
, a stiff equation is a
differential equation In mathematics, a differential equation is an equation that relates one or more unknown functions and their derivatives. In applications, the functions generally represent physical quantities, the derivatives represent their rates of change, an ...
for which certain
numerical methods Numerical analysis is the study of algorithms that use numerical approximation (as opposed to symbolic manipulations) for the problems of mathematical analysis (as distinguished from discrete mathematics). It is the study of numerical methods th ...
for solving the equation are numerically unstable, unless the step size is taken to be extremely small. It has proven difficult to formulate a precise definition of stiffness, but the main idea is that the equation includes some terms that can lead to rapid variation in the solution. When integrating a differential equation numerically, one would expect the requisite step size to be relatively small in a region where the solution curve displays much variation and to be relatively large where the solution curve straightens out to approach a line with slope nearly zero. For some problems this is not the case. In order for a numerical method to give a reliable solution to the differential system sometimes the step size is required to be at an unacceptably small level in a region where the solution curve is very smooth. The phenomenon is known as ''stiffness''. In some cases there may be two different problems with the same solution, yet one is not stiff and the other is. The phenomenon cannot therefore be a property of the exact solution, since this is the same for both problems, and must be a property of the differential system itself. Such systems are thus known as ''stiff systems''.


Motivating example

Consider the
initial value problem In multivariable calculus, an initial value problem (IVP) is an ordinary differential equation together with an initial condition which specifies the value of the unknown function at a given point in the domain. Modeling a system in physics or ...
The exact solution (shown in cyan) is We seek a
numerical solution Numerical analysis is the study of algorithms that use numerical approximation (as opposed to symbolic manipulations) for the problems of mathematical analysis (as distinguished from discrete mathematics). It is the study of numerical methods th ...
that exhibits the same behavior. The figure (right) illustrates the numerical issues for various numerical integrators applied on the equation. One of the most prominent examples of the stiff
ordinary differential equation In mathematics, an ordinary differential equation (ODE) is a differential equation whose unknown(s) consists of one (or more) function(s) of one variable and involves the derivatives of those functions. The term ''ordinary'' is used in contrast w ...
s (ODEs) is a system that describes the
chemical reaction A chemical reaction is a process that leads to the IUPAC nomenclature for organic transformations, chemical transformation of one set of chemical substances to another. Classically, chemical reactions encompass changes that only involve the pos ...
of Robertson: If one treats this system on a short interval, for example, t \in , 40/math> there is no problem in numerical integration. However, if the interval is very large (1011 say), then many standard codes fail to integrate it correctly. Additional examples are the sets of ODEs resulting from the temporal integration of large chemical reaction mechanisms. Here, the stiffness arises from the coexistence of very slow and very fast reactions. To solve them, the software packages KPP and Autochem can be used.


Stiffness ratio

Consider the linear constant coefficient inhomogeneous system where \mathbf y, \mathbf f \in \mathbb^n and \mathbf A is a constant, diagonalizable, n \times n matrix with eigenvalues \lambda_t \in \mathbb, t = 1, 2, \ldots , n (assumed distinct) and corresponding eigenvectors \mathbf c_t \in \mathbb^n, t = 1, 2, \ldots , n . The general solution of () takes the form where the \kappa_t are arbitrary constants and \mathbf g(x) is a particular integral. Now let us suppose that which implies that each of the terms e^ \mathbf c_t \rightarrow 0 as x \rightarrow \infty , so that the solution \mathbf y(x) approaches \mathbf g(x) asymptotically as x \rightarrow \infty ; the term e^ \mathbf c_t will decay monotonically if \lambda_t is real and sinusoidally if \lambda_t is complex. Interpreting x to be time (as it often is in physical problems), \sum_^n \kappa_t e^ \mathbf c_t is called the ''transient solution'' and \mathbf g(x) the ''steady-state solution''. If \left, \operatorname(\lambda_t)\ is large, then the corresponding term \kappa_t e^ \mathbf c_t will decay quickly as x increases and is thus called a ''fast transient''; if \bigl, \operatorname(\lambda_t) \bigr, is small, the corresponding term \kappa_t e^ \mathbf c_t decays slowly and is called a ''slow transient''. Let \overline, \underline \in \ be defined by so that \kappa_t e^ \mathbf c_t is the fastest transient and \kappa_t e^ \mathbf c_t the slowest. We now define the ''stiffness ratio'' as


Characterization of stiffness

In this section we consider various aspects of the phenomenon of stiffness. "Phenomenon" is probably a more appropriate word than "property", since the latter rather implies that stiffness can be defined in precise mathematical terms; it turns out not to be possible to do this in a satisfactory manner, even for the restricted class of linear constant coefficient systems. We shall also see several qualitative statements that can be (and mostly have been) made in an attempt to encapsulate the notion of stiffness, and state what is probably the most satisfactory of these as a "definition" of stiffness. J. D. Lambert defines stiffness as follows:
If a
numerical method In numerical analysis, a numerical method is a mathematical tool designed to solve numerical problems. The implementation of a numerical method with an appropriate convergence check in a programming language is called a numerical algorithm. Mathem ...
with a finite region of absolute
stability Stability may refer to: Mathematics *Stability theory, the study of the stability of solutions to differential equations and dynamical systems ** Asymptotic stability ** Linear stability ** Lyapunov stability ** Orbital stability ** Structural sta ...
, applied to a system with any
initial conditions In mathematics and particularly in dynamic systems, an initial condition, in some contexts called a seed value, is a value of an evolving variable at some point in time designated as the initial time (typically denoted ''t'' = 0). For ...
, is forced to use in a certain interval of integration a step length which is excessively small in relation to the smoothness of the exact solution in that interval, then the system is said to be ''stiff'' in that interval.
There are other characteristics which are exhibited by many examples of stiff problems, but for each there are counterexamples, so these characteristics do not make good definitions of stiffness. Nonetheless, definitions based upon these characteristics are in common use by some authors and are good clues as to the presence of stiffness. Lambert refers to these as "statements" rather than definitions, for the aforementioned reasons. A few of these are: # A linear constant coefficient system is stiff if all of its
eigenvalue In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denoted b ...
s have negative real part and the stiffness ratio is large. # Stiffness occurs when stability requirements, rather than those of accuracy, constrain the step length. # Stiffness occurs when some components of the solution decay much more rapidly than others.


Etymology

The origin of the term "stiffness" has not been clearly established. According to
Joseph Oakland Hirschfelder Joseph Oakland Hirschfelder (May 27, 1911 – March 30, 1990) was an American physicist who participated in the Manhattan Project and in the creation of the nuclear bomb.
, the term "stiff" is used because such systems correspond to tight coupling between the driver and driven in
servomechanism In control engineering a servomechanism, usually shortened to servo, is an automatic device that uses error-sensing negative feedback to correct the action of a mechanism. On displacement-controlled applications, it usually includes a built-in ...
s. According to Richard. L. Burden and J. Douglas Faires,
Significant difficulties can occur when standard numerical techniques are applied to approximate the solution of a
differential equation In mathematics, a differential equation is an equation that relates one or more unknown functions and their derivatives. In applications, the functions generally represent physical quantities, the derivatives represent their rates of change, an ...
when the exact solution contains terms of the form e^, where \lambda is a complex number with negative real part. . . . Problems involving rapidly decaying transient solutions occur naturally in a wide variety of applications, including the study of spring and damping systems, the analysis of
control system A control system manages, commands, directs, or regulates the behavior of other devices or systems using control loops. It can range from a single home heating controller using a thermostat controlling a domestic boiler to large industrial c ...
s, and problems in
chemical kinetics Chemical kinetics, also known as reaction kinetics, is the branch of physical chemistry that is concerned with understanding the rates of chemical reactions. It is to be contrasted with chemical thermodynamics, which deals with the direction in wh ...
. These are all examples of a class of problems called stiff (mathematical stiffness) systems of differential equations, due to their application in analyzing the motion of spring and mass
systems A system is a group of interacting or interrelated elements that act according to a set of rules to form a unified whole. A system, surrounded and influenced by its environment, is described by its boundaries, structure and purpose and express ...
having large spring constants (physical
stiffness Stiffness is the extent to which an object resists deformation in response to an applied force. The complementary concept is flexibility or pliability: the more flexible an object is, the less stiff it is. Calculations The stiffness, k, of a b ...
).
For example, the
initial value problem In multivariable calculus, an initial value problem (IVP) is an ordinary differential equation together with an initial condition which specifies the value of the unknown function at a given point in the domain. Modeling a system in physics or ...
with m = 1, c = 1001, k = 1000, can be written in the form () with n = 2 and and has eigenvalues \overline = -1000, \underline = -1 . Both eigenvalues have negative real part and the stiffness ratio is which is fairly large. System () then certainly satisfies statements 1 and 3. Here the spring constant k is large and the damping constant c is even larger. (while "large" is not a clearly-defined term, but the larger the above quantities are, the more pronounced will be the effect of stiffness.) The exact solution to () is Equation behaves quite similarly to a simple exponential x_0 e^, but the presence of the e^ term, even with a small coefficient, is enough to make the numerical computation very sensitive to step size. Stable integration of () requires a very small step size until well into the smooth part of the solution curve, resulting in an error much smaller than required for accuracy. Thus the system also satisfies statement 2 and Lambert's definition.


A-stability

The behaviour of numerical methods on stiff problems can be analyzed by applying these methods to the test equation y' = ky subject to the initial condition y(0) = 1 with k \in \mathbb. The solution of this equation is y(t) = e^. This solution approaches zero as t\to\infty when \operatorname(k) < 0. If the numerical method also exhibits this behaviour (for a fixed step size), then the method is said to be A-stable. A numerical method that is L-stable (see below) has the stronger property that the solution approaches zero in a single step as the step size goes to infinity. A-stable methods do not exhibit the instability problems as described in the motivating example.


Runge–Kutta methods

Runge–Kutta methods applied to the test equation y' = k\cdot y take the form y_ = \phi(hk)\cdot y_n, and, by induction, y_n = \bigl(\phi(hk)\bigr)^n\cdot y_0. The function \phi is called the ''stability function''. Thus, the condition that y_n \to 0 as n \to \infty is equivalent to , \phi(hk), < 1. This motivates the definition of the ''region of absolute stability'' (sometimes referred to simply as ''stability region''), which is the set \bigl\. The method is A-stable if the region of absolute stability contains the set \bigl\, that is, the left half plane.


Example: The Euler methods

Consider the Euler methods above. The explicit
Euler method In mathematics and computational science, the Euler method (also called forward Euler method) is a first-order numerical procedure for solving ordinary differential equations (ODEs) with a given initial value. It is the most basic explicit m ...
applied to the test equation y' = k\cdot y is : y_ = y_n + h\cdot f(t_n, y_n) = y_n + h\cdot(ky_n) = y_n + h\cdot k\cdot y_n = (1+h\cdot k)y_n. Hence, y_n = (1 + hk)^n\cdot y_0 with \phi(z) = 1 + z. The region of absolute stability for this method is thus \bigl\ which is the disk depicted on the right. The Euler method is not A-stable. The motivating example had k = -15. The value of ''z'' when taking step size h = \tfrac14 is z = -15\times\tfrac14 = -3.75, which is outside the stability region. Indeed, the numerical results do not converge to zero. However, with step size h = \tfrac18, we have z = -1.875 which is just inside the stability region and the numerical results converge to zero, albeit rather slowly.


Example: Trapezoidal method

Consider the
trapezoidal method In calculus, the trapezoidal rule (also known as the trapezoid rule or trapezium rule; see Trapezoid for more information on terminology) is a technique for approximating the definite integral. \int_a^b f(x) \, dx. The trapezoidal rule works b ...
: y_ = y_n + \tfrach\cdot \bigl(f(t_n,y_n)+f(t_,y_)\bigr), when applied to the test equation y' = k\cdot y, is : y_ = y_n + \tfrach\cdot \left(ky_n+ky_\right). Solving for y_ yields :y_=\frac\cdot y_n. Thus, the stability function is :\phi(z)=\frac and the region of absolute stability is :\left\. This region contains the left half-plane, so the trapezoidal method is A-stable. In fact, the stability region is identical to the left half-plane, and thus the numerical solution of y' = k\cdot y converges to zero if ''and only if'' the exact solution does. Nevertheless, the trapezoidal method does not have perfect behavior: it does damp all decaying components, but rapidly decaying components are damped only very mildly, because \phi(z) \to 1 as z \to -\infty . This led to the concept of
L-stability Within mathematics regarding differential equations, L-stability is a special case of A-stability, a property of Runge–Kutta methods for solving ordinary differential equations. A method is L-stable if it is A-stable and \phi(z) \to 0 as ...
: a method is L-stable if it is A-stable and , \phi(z), \to 0 as z \to \infty . The trapezoidal method is A-stable but not L-stable. The
implicit Euler method In numerical analysis and scientific computing, the backward Euler method (or implicit Euler method) is one of the most basic numerical methods for the solution of ordinary differential equations. It is similar to the (standard) Euler method, but d ...
is an example of an L-stable method.


General theory

The stability function of a Runge–Kutta method with coefficients \mathbf A and \mathbf b is given by : \phi(z) = \frac, where \mathbf e denotes the vector with all ones. This is a
rational function In mathematics, a rational function is any function that can be defined by a rational fraction, which is an algebraic fraction such that both the numerator and the denominator are polynomials. The coefficients of the polynomials need not be rat ...
(one
polynomial In mathematics, a polynomial is an expression consisting of indeterminates (also called variables) and coefficients, that involves only the operations of addition, subtraction, multiplication, and positive-integer powers of variables. An exa ...
divided by another). Explicit Runge–Kutta methods have a strictly lower triangular coefficient matrix \mathbf A and thus, their stability function is a polynomial. It follows that explicit Runge–Kutta methods cannot be A-stable. The stability function of implicit Runge–Kutta methods is often analyzed using
order star Order, ORDER or Orders may refer to: * Categorization, the process in which ideas and objects are recognized, differentiated, and understood * Heterarchy, a system of organization wherein the elements have the potential to be ranked a number of ...
s. The order star for a method with stability function \phi is defined to be the set \bigl\. A method is A-stable if and only if its stability function has no poles in the left-hand plane and its order star contains no purely imaginary numbers.


Multistep methods

Linear multistep method Linear multistep methods are used for the numerical solution of ordinary differential equations. Conceptually, a numerical method starts from an initial point and then takes a short step forward in time to find the next solution point. The proce ...
s have the form :y_=\sum_^s a_i y_+h\sum_^s b_j f\left(t_,y_\right). Applied to the test equation, they become :y_=\sum_^s a_i y_+hk\sum_^s b_jy_, which can be simplified to :\left(1-b_z\right)y_-\sum_^s \left(a_j+b_jz\right)y_=0 where z = hk. This is a linear
recurrence relation In mathematics, a recurrence relation is an equation according to which the nth term of a sequence of numbers is equal to some combination of the previous terms. Often, only k previous terms of the sequence appear in the equation, for a parameter ...
. The method is A-stable if all solutions \ of the recurrence relation converge to zero when \operatorname(z) < 0. The characteristic polynomial is :\Phi(z, w) = w^-\sum_^s a_iw^ - z\sum_^s b_jw^. All solutions converge to zero for a given value of z if all solutions w of \Phi(z,w) = 0 lie in the unit circle. The region of absolute stability for a multistep method of the above form is then the set of all z \in \mathbb for which all w such that \Phi(z,w) = 0 satisfy , w, < 1. Again, if this set contains the left half-plane, the multi-step method is said to be A-stable.


Example: The second-order Adams–Bashforth method

Let us determine the region of absolute stability for the two-step Adams–Bashforth method : y_ = y_n + h \left( \tfrac32 f(t_n, y_n) - \tfrac12 f(t_, y_) \right) . The characteristic polynomial is : \Phi(w,z) = w^2 - \left(1+\tfrac32z\right)w + \tfrac12 z = 0 which has roots : w = \tfrac12 \left(1 + \tfrac32 z \pm\sqrt\right), thus the region of absolute stability is : \left\. This region is shown on the right. It does not include all the left half-plane (in fact it only includes the real axis between -1 \le z \le 0) so the Adams–Bashforth method is not A-stable.


General theory

Explicit multistep methods can never be A-stable, just like explicit Runge–Kutta methods. Implicit multistep methods can only be A-stable if their order is at most 2. The latter result is known as the second Dahlquist barrier; it restricts the usefulness of linear multistep methods for stiff equations. An example of a second-order A-stable method is the trapezoidal rule mentioned above, which can also be considered as a linear multistep method.See .


See also

*
Backward differentiation formula The backward differentiation formula (BDF) is a family of implicit methods for the numerical integration of ordinary differential equations. They are linear multistep methods that, for a given function and time, approximate the derivative of that ...
, a family of implicit methods especially used for the solution of stiff differential equations * Condition number *
Differential inclusion In mathematics, differential inclusions are a generalization of the concept of ordinary differential equation of the form :\frac(t)\in F(t,x(t)), where ''F'' is a multivalued map, i.e. ''F''(''t'', ''x'') is a ''set'' rather than a single point ...
, an extension of the notion of differential equation that allows discontinuities, in part as way to sidestep some stiffness issues *
Explicit and implicit methods Explicit and implicit methods are approaches used in numerical analysis for obtaining numerical approximations to the solutions of time-dependent ordinary and partial differential equations, as is required in computer simulations of physical pro ...


Notes


References

* . * . * . * . * . * . * . * . * . * . * . * . *. * * . * . *Stability of Runge-Kutta Method


External links


An Introduction to Physically Based Modeling: Energy Functions and Stiffness

Stiff systems
Lawrence F. Shampine and Skip Thompson
Scholarpedia ''Scholarpedia'' is an English-language wiki-based online encyclopedia with features commonly associated with open-access online academic journals, which aims to have quality content in science and medicine. ''Scholarpedia'' articles are written ...
, 2(3):2855. doi:10.4249/scholarpedia.2855 {{Authority control Numerical differential equations